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Abstract 

The arbitrarily varying channel (AVC) is a conservative way of modeling an unknown in- 
terference, and the corresponding capacity results are pessimistic. We reconsider the Gaussian 
AVC by relaxing the classical model and thereby weakening the adversarial nature of the inter- 
ference. We examine three different relaxations. First, we show how a very small amount of 
common randomness between transmitter and receiver is sufficient to achieve the rates of fully 
randomized codes. Second, akin to the dirty paper coding problem, we study the impact of an 
additional interference known to the transmitter. We provide partial capacity results that differ 
significantly from the standard AVC. Third, we revisit a Gaussian MIMO AVC in which the 
interference is arbitrary but of limited dimension. 

1 Introduction 

The arbitrarily varying channel is an information-theoretic model of communication under worst- 
case noise [6j[7]. In the Gaussian AVC (GAVC) [8J an additive white Gaussian noise (AWGN) 
channel is modified by adding a power-constrained jamming interference signal. As in the discrete 
AVC with constraints [9], the capacity is well defined when the power constraints T and A on the 
input and jammer are required to hold almost surely. When the encoder and decoder share com- 
mon randomness, the jammer can be no more harmful than Gaussian noise, but without common 
randomness the capacity is zero when T < A because the jammer can simulate the encoder and 
"symmetrize" the channel. If T > A then under average error the jammer is again no worse than 
Gaussian noise. 

The GAVC gives one way of understanding the impact of uncertainty on the capacity of point- 
to-point channels. However, the dichotomy for deterministic coding reflects the effect of worst- 
case analysis. By contrast, by assuming the interference comes from power- limited but arbitrary 
random noise, it can be shown that the worst-case noise is Gaussian and the capacity is the AWGN 
capacity [10|,ll 1 j . Similarly, by allowing feedback and causal coding, arbitrary interference in an 
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"individual channel" model may also look Gaussian [12J. In this paper we reexamine the GAVC 
model to see how different variants of the model shed insights into what can be achieved against 
"worst-case" interference. 

We describe three variants of the GAVC model: 

1. In the first model we allow the encoder and decoder to share a limited amount of common 
randomness. In particular, we show that O(logn) bits of common randomness are sufficient 
to achieve the randomized coding capacity of the AVC, where n is the coding blocklength. 
Essentially, a small amount of randomness is sufficient to make the most malicious interference 
as harmless as random noise. 

2. In the second model, in addition to the noise and the jammer, there is an additional in- 
terference which is known only to the transmitter. A new achievable rate is found for this 
setting. Perhaps somewhat surprisingly, the presence of this additional interference increases 
the capacity, helping to beat the jammer. Capacity results are found for special cases. 

3. The third model is the Gaussian MIMO AVC under fully randomized coding [13]. In addition 
to a power constraint, we also constrain the dimensionality of the jamming signal. In general 
this leads to higher rates, and we find the exact capacity for the case of 2 transmit and 2 
receive antennas under interference from a single- antenna jammer. 

More generally, these relaxations of the Gaussian AVC shed some light on the nature of worst- 
case interference. There are two ways in which the jammer behaves in a worst-case manner. The 
capacity dichotomy for deterministic coding arises because the jammer can simulate the valid 
encoder. When the capacity is positive, it is limited by the jammer choosing the worst noise distri- 
bution. Our work shows that if the jammer cannot implement these strategies, the corresponding 
capacity is often higher. Because these three models are somewhat different from each other, we 
introduce the relevant definitions with the results^. 

2 Limited Common Randomness 

Our first relaxation of the GAVC is a classic one - we allow the encoder and decoder to share 
common randomness that is unknown to the jammer [Sj. However, in contrast to previous works, 
we focus on the amount of randomization, or key size. Our main result is that as in the discrete 
case, we can use a sub-exponential number (in the blocklength n) of codebooks to obtain an 
asymptotically decreasing upper bound on the probability of error. 

2.1 Channel model 

The Gaussian AVC is shown in Figure [TJ For an input sequence x E W 1 the output of the Gaussian 
AVC is given by 

Y = x + s + W. 

1 Note to reviewers: For ease of reviewing, the proofs are provided in the relevant sections. However, it is our 
intention to move most of the technical lemmas to the appendix so that the main body of the paper can be read 
easily. 
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Figure 1: The Gaussian arbitrarily varying channel under randomized coding. 



The input is corrupted by iid additive white Gaussian noise W with variance and an unknown 
interference vector s. The input signal x and jammer signal s are constrained in power: 

1 II l|2 ^ -n 
— X < 1 
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Define 

S n (A) = {s: ||s|| 2 <™a}. (1) 

An (n, N) deterministic code C satisfying the input constraint T is a pair of maps ((ft, ip) with 

<ft : [N] -> R n 
ift : R n [N], 

such that for all i G [N] we have ||</>(i)|| 2 < nr. An (n,N) randomized code C satisfying the input 
constraint T is a random variable taking on values in the set of (n, N) deterministic codes. It is 
written as a pair of random maps \P) where each realization is an (n, N) deterministic code 
satisfying the constraint T. If ($, almost surely takes values in a set of K codes, then we call 
this an (n, N, K) randomized code. The number K is called the key size of the randomized code. 
In this section we will consider randomized coding and maximal probability of error: 

e(C, s) = maxEc [P W (*(*(«) + s + w) ^ *)] (2) 

i£[N] 

e(C)= max £ (C,s). (3) 

sg5 n (A) 

A rate R is achievable under maximal error with randomized coding if there exists a sequence 
of (n, [exp(ni?)]) randomized codes whose maximal error goes to as n — > oo. The randomized 
coding capacity under maximal error C r (T,A, a^) is the supremum of the achievable rates under 
maximal error with randomized coding. We will write C r when the parameters are clear. 

2.2 Main results 

Hughes and Narayan [8] showed that if the input and jammer are both bounded in power almost 
surely and the random variable C is unconstrained, then the capacity is equal to that of an additive 
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white Gaussian noise (AWGN) channel with the jammer treated as additive noise: 

a( r,A,0 = i 1 „ g (i + ^). (4) 

Csiszar and Narayan [14J showed that if only deterministic codes are allowed and the error criterion 
is replaced by the average probability of error: 

1 N 

£ "( C ) = ^E P W WW + 8 + W) ^ i) , 

1=1 

the capacity is equal to ([4]) if and only if the encoder has a higher power limit than the jammer: 

Q(r,A,o = | c r (r,A,4) r>A. (5) 

Recall that f(n) = 0(g(n)) means there is a constant c such that f(n) < cg(n) for sufficiently large 
n. Our main result is to show that if log K(n) = 0(log n), for any e c > the rate C r —e c is achievable 
using randomized codes with key size K{n). That is, O(logra) bits of common randomness is 
sufficient to achieve the randomized coding capacity of the GAVC. 

Theorem 1. The randomized coding capacity C r (F, A, cr^) of the GAVC is achievable using ran- 
domized codes whose key size satisfies log K(n) = O(logn). 

2.3 Analysis 

The class of randomized codes we consider can be built in two steps. Similar to the discrete AVC 
construction in [15], we "modulate" a single Gaussian codebook. Let N = exp(ni?) and M be an 
arbitrary integer. 

1. Let B = {xi,X2, . . . ,xtv} be a set of N vectors on the sphere of radius y/nT. We can choose 
this set to have small maximal error for both the AWGN channel with noise variance A + 
and the channel with additive noise V + W, where V is uniform on the sphere of radius y/nA 
and W is iid Gaussian noise with variance <r^. 

2. Let {Uk : A; = 1,2,..., K} be n x n unitary matrices generated uniformly from the set of all 
unitary matrices. Without loss of generality take U\ = I. 

3. The randomized code is uniform on the set {UkB : k = 1,2, .. . , K}. To send message i, the 
encoder draws an integer k uniformly from {1,2, . . . , K} and encodes its message as Uk^-i- 

4. The decoder knows k and chooses the codeword in Bk that minimizes the distance to the 
received vector y: 

0(y, k) = argmin ||y - U k Xj\\ . 
3 
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For all rates below (1/2) log(l+r/(A+o* )) we can choose the codebook B to have exponentially 
decaying probability of error |10U16t [TT] for both the AWGN channel with noise variance A + cr^, 
and the channel with additive noise V + W: 

e(B) < exp(-n£;(n- 1 logAf)). 

We can use this result to get a lower bound on the pairwise distance between any two codewords. 

Consider two codewords Xj and Xj from the set B. Let 7 > be half the distance between 
them: 

||xj - Xj|| = 27. 

Suppose that we transmit Xj over an AWGN channel with noise variance A + a^- Then the 
probability of error for message i can be lower bounded by the chance that the noise in the direction 
of Xj — Xj is larger than 7. Since the noise is iid, the error can be bounded by the integral of a 
Gaussian density [15] : 

e(i) > — / exp f — tttZ 2 ) dz 

W - V2vr(A + al)J~ 1 V \ 2(A + ^) J 

A + o?„ 
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exp(- 7 2 /2). 



Therefore there exists a fj, > such that for sufficiently large n we have 7 > (/i/2)y / n for some 
H > 0, which means that 

||Xj — Xj|| > fly/n. (6) 

We prove a more refined version of Theorem [TJ 

Theorem 2. Let K{n) be chosen such that K(n)/n — > 00 and n^ 1 \og(K(n)/n) —> 0. For input 
power constraint T, jammer power constraint A, and £ > there is an n sufficiently large and an 
(n, N, K{n)) randomized code for the GAVC of rate R < C r (T, A, cr^), where 

c r( r,A,^) = i,„ g (i + x J-j 

whose error satisfies 

e(n) = C " 



K(n) 



Proof. Fix a rate R < C r . We will suppress the dependence of K(n) on n in the proof. We need 
to show that for n sufficiently large, there exists a codebook B and K unitary matrices {Uk} such 
that the probability of error is bounded for any choice of s. To do this we first show that if s lies 
in a dense subset of the V nA sphere, then the event that the average error for K randomly chosen 
matrices {Uk} is too large has probability exponentially small in K. Therefore we can choose a 
collection that satisfies the probability of error bound for any s. 
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Consider the codebook 13 of N vectors from the sphere of radius y/nT. The expected performance 
of this code is good for an additive noise channel with noise V+W, where V is distributed uniformly 
on the sphere of radius \fnA. That is, for any 5 > there exists an n sufficiently large such that 

maxEv w W, V)] < exp(-nE(R)) = 5. (7) 

i€[N] 

Suppose that we sample K points Vi, V2, . . . , independently from the distribution of V. 
Then standard concentration bounds show that 

Pv,w N^5>(^V fe )>iJ 

< exp (-K(t log 5' 1 - h b (t) log 2)) . (8) 

A union bound over all i G [N] shows 



■ V„,w 



U {^X>> v *)>4 

; e [jVl I k=l ) 



< exp (-K(t log -T 1 - h b (t) log 2) + log N) . (9) 

Thus the probability that the collection of points {v m } induces an error probability that exceeds t 
is exponentially small in K. 

Now consider drawing K unitary matrices {£/& : k = 1, 2, . . . , K} uniformly. For a fixed v, the 
points 

v fc = u-\ 

are uniform samples from V, and = U^W are uniform samples from A/"(0, cr^J). Let {a m : 
m = 1,2,... M} be a set of vectors on the sphere of radius V nA. Another union bound yields the 
following: 



U U \^Y,e[i,U^)>t\ 

m=liG[JVl I k=l J 



i£[N] 

< exp (-K(t log 5' 1 - h b (t) log 2) + log M + log N) . (10) 

Results of Wyner [19] and Lapidoth [20J show that there exists a collection of exp(n(/9 + e)) 
points on the v 7 nA-sphere such that any point on the V nA-sphere is at most a distance r\ from one 
of the points, where 77 and p are related by p = (1/2) log (A/77 2 ). Choose M = exp(n(p + e)) and 
let {a m } be the corresponding rate-distortion codebook. The bound (fTUj) implies 



U U {^E^^)>4 

m=li<=\N] I k=l J 



< exp log 5- 1 - h b (t) log 2) + n(p + + e)) . (11) 
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If K(n)/n —> oo then the probability that the error is smaller than t for the M points {a m } can 
be made arbitrarily close to 1 for any r]. The next step is to argue that we can extend the bound 
from s G {a m } to all s. 

Because R < C r (T, A, <r^), for a sufficiently small constant v we have 

i / r 

R < - log 1 + 



2 b V (1 + ^) 2 A + o 2 w 

for some sufficiently small constant v. That is, we can choose our code to have small error proba- 
bility for noise of variance (1 + u) 2 A + a^. The bound in (jlip shows that for each message i there 
is a set /Cj of at least (1 — t)K keys for which 

||xj - Xj + (1 + v)U k l s. m \\ > (1 + v)\\a m \\ Vj / i 

Equivalently, we can write 

2(xj-Xi, C/- 1 ^) < ||xj - Xj || 2 . (12) 



Now suppose the jammer's input is s. For each k € /Q, the rate-distortion codebook property 

4" 



guarantees that s is only a distance rjy/n from some point U k 1 a m . We would like to prove a bound 



like {H]) for all s. To start: 

2 (xj - Xj, s) = 2 (xj - Xj, s - U^a, 



-in i 

-l. 



+ 2 ^Xj Xj, [7^, a r , 
r- 1 

< 2 • ||xj — Xj || • r]\/n + ||xi 



1 + f 



< I 2^ + ) | x, 

/i l + z/ / 



where we used (|12p . the Cauchy-Schwartz inequality, the distortion bound for {a m }, and and ([6]). 
Now choose rj sufficiently small so that 

2 (Xj - Xj, s) < ||xj - Xj|| 2 . (13) 

This shows that the minimum distance decoding rule results in a small error probability for all s 

1 1 2 

with ||s|| = nA. 

The last thing we need is to show that the average error probability is monotonic in the length 
of the jamming vector for a given direction. Suppose that there was an error for s but now the 
jammer inputs (1 + b)s. Then there is an error if 

||xj - x fc + (1 + 6)s|| < (l + 6)||s||. 

But we can easily bound this using the triangle inequality: 

||xj - Xfc + (1 + 6)s|| < ||xj - x fc + s|| + ||6s|| 
< (l + 6)||s||. 

Thus the error probability can only become smaller for shorter jamming inputs s. 

We have shown that for any t > there is an n sufficiently large such that with high probability, 
choosing a random set of K unitary matrices {Ufc} results in a randomized code whose error can 
be made smaller than t. Therefore such a randomized code exists. □ 
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Figure 2: The arbitrarily varying degraded Gaussian broadcast channel. The jammer is shared 
between both receivers. Since we assume the noise W2 has higher variance that wi, we call receiver 
1 the "strong" user and receiver 2 the "weak" user. 



2.4 An application to degraded broadcast 

We can apply Theorem [2] to the a degraded broadcast channel with a common jammer. In this 
setting we consider deterministic coding. We show that one receiver can use the codeword of the 
other receiver to enable randomized coding for its message. The channel model is given by 

Yi = x + s + Wi 
Y 2 = x + s + W 2 , 

where Wi is iid Gaussian with variance erf, W 2 is iid Gaussian with variance cr|, and a\ < a\. 
The channel is shown in Figure [2j We call receiver 1 the strong user and receiver 2 the weak user. 

An (n, Ni, N 2 ) deterministic code with power constraint T for this channel is a tuple of maps 
((f>,ipi,ip2), where 

: [iVi] x [N 2 ] -»• M n 
ifo. : W n -»• [Ni] 
ip 2 :R n ^ [N 2 ], 

and ||(/)(i,j)|| 2 < nT for all The map (f> is the encoder and the maps tp\ and ip 2 are the 

decoders for users 1 and 2. The average probability of error for the code under state constraint A 
is 

Ni N 2 

sG5"(A) JViiV2 t^f Z — ' 

% — 1 J 1 

The error is averaged over the messages to both users. We say the pair of rates (R\, R 2 ) is achievable 
if there exists a sequence of (n, exp(ni?i), exp(nR 2 )) deterministic codes whose average error goes 
to as n —7- 00. The capacity region is the union of achievable rates. 

The discrete arbitrarily varying broadcast channel without constraints was first studied by Jahn 
|21j . who proved an achievable rate region for randomized coding and then applied the elimination 
technique [22] to derandomize the code. This approach does not work in general for constrained 
AVCs. Discrete constrained AVCs with degraded message sets were studied by Hof and Bross [23] , 
Their achievable strategy requires a number of non-symmetrizability conditions which are analogous 
to our result in Theorem [3l 
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We build a superposition code [M] based on our rotated codebook construction. The strong 
user can treat the message for the weak user as a random key in a randomized code. The codebook 
for user 2 is a deterministic code ("cloud centers") with power aT and the codebook for user 1 is 
a randomized code with power (1 — a)T, where the randomization is over the codewords of user 2. 
From Theorem [2] we can see that the randomization provided by user 2's message is sufficient for 
user 1 to achieve the randomized coding capacity. This scheme is limited [14J to those a for which 



Theorem 3. If A > T then the deterministic coding capacity region of the arbitrarily varying de- 
graded Gaussian broadcast channel is the empty set. If A < T then for a S (A/T, 1], the rates 
( , i?2 ) satisfying the following inequalities are achievable with deterministic codes for the arbi- 
trarily varying degraded Gaussian broadcast channel under average probability of error: 



Proof. The converse follows from the converse for the standard AVC. Since we are limited to 
deterministic codes, if A > T the jammer can choose a message pair (i',f) and transmit <j)(i',j') 
plus additional noise. 

To show the achievable rate region, suppose that A < T and generate a codebook using Theorem 
[2] containing N2 = exp(rai?2) codewords {v,} on the v / noT-sphere. For each Uj generate N\ = 



exp(ni?i) codewords {vjj} uniformly on the a/ (re — 1)(1 — a)r-sphere. Let the overall codebook 
be: 



where Ai is an isometric mapping of M n to the plane orthogonal to Uj and Ui is a random 
unitary transformation as in the construction of Theorem [2 The codebooks satisfy the the power 
constraint. 

The weak decoder first decodes Uj, treating the signal AiUiVij as additional noise. From the 
results of Csiszar and Narayan [T3] we know that the average probability of error can be made 
small if aT > A. This gives the first rate bound. 

The strong decoder replicates the first step of the weak user. If message i was decoded correctly, 
it can subtract out Uj and the residual channel is identical to a GAVC with input power (1 — a)r 
using the codebook of Theorem [2j This gives us the second rate bound. 

To see the sum-rate bound (fl~6j) , note that the weak user can give up any part of its message to 
the strong user, which means that rate splitting between the points where a = A/T and a = are 
also achievable. □ 

A plot of the achievable rate region is shown in Figure El This achievable region is tight for 
a > A/T because the jammer could just add Gaussian noise to make the channel a degraded 
Gaussian broadcast channel [25-28J. The coding scheme above cannot be used in the regime where 
a < A/T because the jammer can symmetrize the {uj} codebook to the stronger user. At present 
we do not know if new achievable strategies can achieve higher rates in this regime or if different 
converse arguments can show that rate splitting is optimal. 



aT > A. 




(14) 



(15) 



(16) 
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Figure 3: Achievable rates for the degraded broadcast Gaussian AVC with T = 6, A = 1, a\ = 0.1, 
and u\ = 5. 
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Figure 4: The Gaussian arbitrarily varying channel with a known interference signal at the encoder. 



3 Dirty paper coding for AVCs 

In this section we turn to a different AVC model in which there are two sources of interference, one 
of which is known to the transmitter. The benefits of channel state information at the transmitter 
have been investigated by researchers since Shannon |29j. In one version of the problem, a time- 
varying state sequence is known non-causally at the transmitter, and the encoder can base its 
codebook on this known sequence. The capacity for discrete channels with iid state sequences was 
found in the celebrated paper of Gel'fand and Pinsker [30]. Costa [31] showed an analogous result 
for the Gaussian case and showed that the capacity is equal to that of a channel with no interference 
at all. His strategy is called a "dirty paper code." These results have found applications to inter- 
symbol interference (ISI) channels [32], watermarking [33], multi-antenna broadcasting [34], and 
models for "cognitive radio" [35 , 36] . We show that additional interference can increase the capacity 
of the GAVC when the encoder and decoder do not share any common randomness. 



3.1 Channel model 

We will consider channels with inputs and outputs in 1" of the form 

Y = X + T + s + W. (17) 

Here we take W ~ JV(0, o^I), ||s|| 2 < An, ||X|| 2 < Tn, and T ~ jV(0, of I). The channel input 
created by the transmitter is X, the vector T is interference known to the transmitter, s is jamming 
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interference, and W is the independent noise at the receiver. If randomized coding is allowed, then 
Costa's result implies that the capacity is equal to the AWGN capacity without T and the jammer 
treated as additional noise. 

An (n,N) code with power constraint T for this channel is a pair of functions (<fi,ip), where 
4> : [N] x E n R n and : R n [N] and 

\\<p{i,T)\\ 2 < nT a.s.. 

The average probability of error (over W and T) for this code with jammer power A is given by 

1 N 

e= max — V P (ip(<f>(i, T) + T + s + W) ^ i) . 
seS n (A) N f— ' 
1=1 

A rate R is achievable if there exists a sequence of (n, [exp(ni?)]) codes with e n — > as n — > oo. 
The capacity Cd is defined to be the supremum of all achievable rates. For of = this channel 
model reduces to the Gaussian AVC [14] whose capacity is given in ([5]). 



3.2 Main result 

Our main result is an achievable rate region for the Gaussian AVC with partial state information 
at the encoder that is achievable using a generalized dirty-paper code. For some parameter values 
the achievable rate is the capacity of the channel. One way of interpreting this result is that the 
presence of extra interference known to the transmitter boosts its effective power and therefore 
lowers the power threshold for the standard Gaussian AVC. 



Theorem 4. Let 



(a,p) : ^ == '— > A 

T + 2pa^/fa I t +a^ 2 t 



A{K) 



Pu = T + 2payjTaf + a'af 
P! = K + a 2 w 

P Y = T + Ip^Tol + a\ + A + o* . 
The following rate is achievable: 

R= max ilogf- J, 1 = V (18) 

In Costa's original paper, choosing p = and a = ao, where 

r , . 

r + a + 0-4 

gives an achievable rate of (1/2) log(l +r/(A + o^)). We have a simple corollary to show when our 
general scheme achieves capacity. 
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Corollary 1 (Capacity achieving parameters). IfT,A and o~1 are such that 



A < ff±2¥£, (20) 
r + a 2 af 



then the capacity of the channel ([T7]) under deterministic coding 

1 / r 

C d = - log 1 + 



2 b V A + a2 

and is achievable using the dirty paper code. 

There is a threshold on A making the capacity equal to 0. If the jammer can simulate both the 
known interference and the transmitter's strategy, then it can symmetrize the channel. The proof 
is straightforward and omitted. 

Lemma 1 (Naive outer bound). We have 

f A>(a t + Vf) 2 

~ 1 § log ( 1 + J otherwise 



3.3 Analysis 

Our codebook construction uses two auxiliary rates Rjj and -Rbin and will depend on parameters a 
and p to be chosen later and positive constants ei and e 2 that can be made arbitrarily close to 0. 

1. The encoder will generate an auxiliary codebook {Uj} of exp(n(Rjj — ex)) vectors drawn 
uniformly from the n-sphere of power Pjj. 

2. These codewords are divided randomly into exp(n(-R — 2ei)) bins {B m } such that each bin 
has exp(n(i?bi n + ei)) codewords. We denote the i-th codeword of bin B m by U(m,i). 

3. Given a message m and an interference vector T, the encoder chooses the vector U(m, i) G B m 
that is closest to /3T, where 

0> = *(i + 



a\ \ P,j-(l-p')r, 
If no such U(m, i) exists then we declare an encoder error. The encoder transmits 

X = U(m,i) -oT. 
We will show that for £2 > 0, we can choose n sufficiently large such that 

||x|| 2 <r (21) 



(U(m, i) - oT, T) > pyjTa? - e 2 . (22) 

4. The decoder first attempts to decode U(m, i) out of the overall codebook {Uj} and produces 
an estimate U(rh,i). It then outputs the estimated message index rh. 
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Figure 5: Geometric picture for dirty-paper encoding with general parameters. 

We will analyze the performance of this coding strategy on an AVC with for general p and a. 
Lemma 2. Suppose 



(23) 



Then for any e\ > and e% > in the code construction and any e' > 0, there exists an n sufficiently 
large such that 

P (3U(m, i) € B m : J2H), fcoirf) > 1 - e. 



Proof. Consider the picture in Figure [5] and let 



Pi 



u 



err 



1 + 



(i - P 2 )r 
iV-(i-p2)r 



We must show that a U(m, i) G /3 m exists satisfying (|22p . By the rate-distortion theorem for 
Gaussian sources, for i?bi n satisfying ([23j) . the codebook ^ m chosen uniformly on the sphere of 
power P\j can compress the source /3T to distortion 

Pu{l~P 2 )T 



D 



Pjj-il-p^T- 



To see this, consider the test channel (3T = U + V, where V is iid Gaussian with variance D. The 
mutual information of this test channel is 

-/ (/3 TAU) = -log(l + -£ 

1, I Pa 

= 2 lo H(T^)r;' 

We can choose U(m, i) to be the codeword in B m that corresponds to quantizing (3T. For any 
ei > the codebook /3 m has rate greater than the rate distortion function for the source /3T with 
distortion D, so there exists an e > such that with high probability, 



3T-XJ(m,iW <D-e. 



13 



For any 5 > we can choose n sufficiently large such that with high probability we have 
If we choose 5 small enough, we can find an r\ > such that with high probability, 



(U, T) > 



1 Pv°l 
Pu + D 



+ rj 



a|(P t/ -(l-p2 )r)+r? 



P\l Va t + aa t + V- 



Therefore with high probability we have 



(U - aT, T) > pJTaf + r,/2 



Therefore we can choose e 2 to satisfy (|22p . 

The last thing to check is that we can satisfy the encoder power constraint with high probability. 
Setting X = U — aT, we can see that for any 5' > 0, with high probability 



||X|r < Pu - 2a (U, T) + a 2 at + 5'. 
Choosing 5' sufficiently small yields ||X|| 2 < T — arj, which proves the result. 



□ 



Proof of Theorem \4\ We will choose the constants ei and 62 according to Lemma [2j The decoder 
must decode U(m, i) from the received signal Y: 



Y = U(i) + (1 - a)T + s + W. 



(24) 



The codebook {U^} can be used to achieve any rate below the deterministic coding capacity of the 
GAVC with input U, noise W + (1 — a)T, and jamming interference s, provided Pu > A. We can 
therefore choose Rjj to be equal to this capacity and for fixed a and p we calculate the capacity in 
what follows. 

We first find the power of the component of T that is orthogonal to U: 



(U, T) 
IIUII 2 



U+ T 



(U, T) 

nun 2 



u 



(25) 



From (|22p we see that for any 5 > we can choose n sufficiently large that 

2e 2 ) > 1 - S. 



(U, T) > pJTa, 



ao~ t 



Let Pt be the expected power in the second term of (I25p . Then for sufficiently large n we also have 



Pt < 



( (pV^oj + aa 2 t - 2e 2 



0+ 



Pi, 




> 1-5. 
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Some algebraic manipulation reveals that there is a constant c such that 



In the GAVC (124ft we define the equivalent noise variance as Pi + (1 - a) 2 P T . 

In order for U to be decodable, Rjj must be smaller than the capacity of the corresponding 
AWGN channel: 

Ru < - log 



2 6 V(i-«) 2 (i-p 2 )r^ 2 + P/Pc/ 



Then — i?bin gives the term to be maximized in ()18j) . Note that in the presence of a jammer 
with power constraint A, the U codebook is only capacity achieving if the received power in the U 
direction exceeds A. This received power is: 



T+(l + a)p v / T^+aa 



2 



V 



Thus for (a, p) 6 -4(A) the the GAVC threshold for the U codebook can be met and U can be 
decoded. Lemma [2] shows that for large n the encoding will succeed, so the probability of error can 
be made as small as we like. □ 



3.4 Examples 

Figure [6] shows an example of the achievable rate versus T. The two circles show the thresholds 
given by Corollary Q] and the threshold for the standard Gaussian AVC with deterministic coding 
and average error. The presence of the known interference T extends the capacity region relative to 
the standard AVC and achieves capacity for values of T that are smaller than the jammer constraint 
A. Thus far we have been unable to improve the converse for the region in which DPC does not 
achieve capacity; it may be that a different coding scheme exploiting the interference T can achieve 
higher rates in this regime. 

One application of dirty paper codes is in watermarking, in which an encoder must encode a 
message m in a given covertext (e.g. an image) which is modeled by an iid Gaussian sequence 
T. The encoder produces a stegotext V = <fi(m, T) + T that satisfies a distortion constraint 
||V — T|| = ||</>(m, T) 1 1 2 < r. A limited class of attacks are additive attacks, which take the form of 
an additive signal s that is independent of the stegotext V such that the receiver gets Y = s + V. 
In this model, if the encoder and decoder share common randomness, then standard Gaussian AVC 
results imply that the highest rate that can be transmitted via the stegotext is the dirty paper 
coding capacity with s equal to Gaussian noise. We can call this the randomized watermarking 
capacity. 

By contrast, if there is no common randomness we can use Theorem [J] with the noise W set to 
to find achievable rates for this problem under deterministic coding; a decoder should be able to 
read the watermark without sharing a secret key with the encoder. Because the encoder does not 
want to distort the covertext by too much, an interesting regime for deterministic watermarking 
is when the power <r| of the covertext is much higher than the distortion limit T of the encoder. 
From (|20p we can see that large a\ benefits the encoder by increasing the effective power of the 
auxiliary codebook to beat the jammer A. 
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Achievable rate versus r 

0.8 I i i ; 1 1 | | 




01 23456789 10 

r (transmitter power constraint) 



Figure 6: Rates versus F for A = 5, cr 2 = 2, and cr^ = 1. The solid line is the achievable rate 
and the dashed line is the outer bound. The dotted line is the AVC capacity without the known 
interference signal. The threshold for the dotted line is at T = A and the DPC threshold is given 
by (HI. 



By setting equality in ([20]) . p = 0, and a = ao, we can solve for a\. Let (3 = A/T be the ratio 
of the attack distortion to the watermark distortion. A little algebra reveals: 

*f =r Q/3(5 + 4/3)V2 

As we can see, the required a\ grows like /3 3 / 2 . Therefore for a fixed watermark distortion, the 
cover text variance must increase like A 3 / 2 in order to communicate at the randomized watermarking 
capacity. 



4 Rank-limited jammers 

In this section we study a model for multiple-input multiple-output (MIMO) Gaussian channels |37| 
under limited jamming. Hughes and Narayan |13j found the randomized coding capacity of an 
M x M MIMO AVC when the jammer has M antennas as well. A game theoretic model for this 
problem, carried out most fully by Baker and Chao [38J, uses the mutual information as a payoff 
between one player who can choose a transmit covariance matrix and another who can choose the 
noise covariance matrix. 

In this section we also consider fully randomized coding, but restrict the jammer to have only a 
single antenna. This means that the set of noise-plus-interference covariance matrices is no longer 
convex and requires new analysis. When we limit the jammer's degrees of freedom, characterizing 
capacity becomes more difficult. To illustrate our problem, consider the two possible configurations 
shown in Figure [3 A 2 x 2 MIMO system is subject to unknown interference from a single antenna 
system. Because the location of the interferer is not known prior to transmission, the MIMO 
system must choose a rate and coding scheme that will work regardless of the interferer's location. 
The fact that the interferer has a single antenna means that the interference lies in an unknown 
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Configuration 1 



Tx 



J 
J 



Int ¥ 



Configuration 2 




Figure 7: Two different configurations for a system with a single-antenna interferer. Because the 
location of the interferer is unknown, the subspace in which the interference lies may be unknown 
prior to transmission. 



one-dimensional subspace of the received signal. We show that this limitation can be exploited to 
achieve rates higher than the full-rank jammer [13]. To focus on these rank effects, we assume the 
encoder and decoder share common randomness. 

4.1 Channel model 

For simplicity, we will treat our MIMO channel as a vector Gaussian channel. Over a blocklength 
n, the channel is given by 

Y = X + sg T + W, (26) 

where X, Y, and W taking values in W nxM , g is an arbitrary unit vector in W M , the interference 
s is subject to the same average power constraint |s|| < nA, and each row of W is i.i.d. with 
distribution A/"(0, T,w), where is a positive definite M x M covariance matrix. The transmitter 

1 1 2 

is also subject to a sum power constraint ||X|| < nr. We can, without loss of generality, take 
the noise covariance matrix to be diagonal, so T*w = diag(cr|, a\, ■ ■ ■ , The interference is 

constrained to a rank-1 subspace, albeit an unknown one. We must therefore design a coding 
scheme that works for all values of g. We call such a channel an (M, M, 1) MIMO AVC. It is easy 
to generalize this model to (Mr, Mr, Mj) MIMO AVCs with Mt transmit antennas, Mr receive 
antennas, and Mj jamming antennas. 

An (n, N) deterministic code with power constraint T for this channel is a pair of functions 
O,^), where <j> : [N] -> R™ xM and V : M nxM -> [N] and 

U(i)\\ 2 < nT Vi€[JV]. 

An (n, N) randomized code with power constraint T is a random variable ($, taking values in 
the set of (n, N) deterministic codes. The maximal probability of error for a randomized code 
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under a rank-1 jammer with power A for the channel (|26p is 

e= max max maxP (\l/(<l>(i) + sg T + W) ^ i) . 

sGR":||s||<nA g GR M :||g||=l i£[N] 

A rate R is achievable under randomized coding and maximal error if there exists a sequence of 
(n, \exp(nR)~\) randomized codes whose maximal error goes to as n — > 00. The randomized 
coding capacity C r is the supremum of the achievable rates. 



4.2 Main Result 

In the case without the rank constraint on the interference, the jammer can also allocate power to 
all the degrees of freedom in this channel. This channel is equivalent to a vector Gaussian AVC |13| 
and the capacity for general M under randomized coding is known to be given by a "mutual 
waterfilling" strategy. Both the transmitter and jammer choose diagonal covariance matrices. The 
jammer chooses a covariance diag(Ai, A2, . . . , Am) by waterfilling over the noise spectrum: 

A* = max {A : (A- a 2 m ) + < a} (27) 
A m = {\*-a 2 m ) + . (28) 

The transmitter then chooses a covariance diag(ri, T%, . . . , Tm) based on this worst jamming strat- 
egy: 

7 *=max{ 7 : ( 7 -^-A m ) + <r} (29) 
T m = { 1 * -a 2 m -K m ) + . (30) 

Hughes and Narayan [13] showed that this allocation is a saddle point for the mutual information 
and is achievable for the Gaussian AVC with randomized coding. Later, Csiszar [39J showed that 
the capacity for deterministic codes is also given by this allocation if T > A. 

By treating the jammer as if it has M antennas, the capacity of the vector Gaussian AVC |13| 
is an achievable rate for the (M, M, 1) MIMO AVC model. 

Theorem 5 (Full rank jammer [E]). For the (M, M, 1) MIMO AVC, the following rate is achievable 
using randomized coding: 

M 1 / r A 

fl» ffl i = E2 lo H 1+ Arfsr)' (31 » 



in 



m=l 

where {r m } and {A m } are given by the waterfilling solutions in (|27[) — (j30|) . 

However, the rank constraint on the jammer should admit rates higher than -R w fin, since in 
many cases the jammer's waterfilling strategy does not satisfy its rank constraint. By examining 
the arguments of Hughes and Narayan |13| . we can find an achievable rate for this channel. If the 
transmitter fixes a covariance matrix first, then the following rate is achievable: 

p* . 1, det(S x + Ew + Agg T ) ,„ . 
R = max mm -log =r . (32) 

2x:tr(E x )<r g:||g||=l 2 det(S W + Agg T ) 
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Unfortunately, even the inner minimization is not convex in general, so standard optimization 
techniques are difficult to apply. If the max-min is equal to the min-max then this expression is 
the capacity. An optimistic upper bound on the capacity can be found by assuming the jammer 
adds A to the sub-channel with the weakest noise and then using the waterfilling solution for the 
transmitter, which proves the following theorem. 

Theorem 6. Suppose a\ < a\ < ■ ■ ■ < aj^ and let t\ = a\ + A and T m = for m > 2. Then for 

7* = max < 7 : (7 - T m ) + < T > 

I. m=l ) 

r m = (7 - r m ) + , 

the capacity of the (M,M, 1) MIMO AVC is upper bounded by 

M 



m=l v ' 



(33) 



Our main result is a characterization of the optimal strategies in (|32|) . The following theorem 
shows that the maximizing input covariance T,x is diagonal and the corresponding minimizing 
jamming strategy is to jam a single subchannel. 

Theorem 7. The input covariance matrix T,x maximizing the rate II Sty) for the (M, M, 1) MIMO 
AVC is diagonal. Suppose Sx = diag(ri,T2, . . . , T^f). Then the worst-case jamming direction g 
is equal to e m , where 

m = argmax — ^ -. (34) 

As an example, in some cases the same waterfilling allocation for the transmitter can achieve 
rates higher than (f31~j) . 



Corollary 2. Let o\ < o~\ < ■ ■ ■ < o~\,, and let {r m } be given by the waterfilling allocation in ([2 
and (j3"0j) . The following rate is achievable over the (M,M, 1) MIMO AVC: 

If A < a\ — a\ then this rate is equal to R u b in (|33p and is the capacity. If A > o~\ — o~\ then this 
rate is larger than R w m\ in- (JHJ. 



4.3 Analysis 

We begin with a simple technical lemma whose proof we include for completeness. 

Lemma 3 (Matrix Determinant Lemma). Let A be an M x M positive definite matrix and U and 
V be two M x k matrices. Then 

det(A + UV H ) = det(A) det(I fc + V H A- 1 U). 
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Proof. First, 



A -U ' 




A " 




V H I 




V H I 





I -A~ l U 
I + V H A- 1 U 



Taking determinants on both sides yields the result. 
Proof of Theorem^ We prove the second part of the theorem first. Let 

r , v s 1, det(S x + + Agg T ) 

M S X,g) = 77 log j— = — • 

2 det(S w + Agg J ) 

So we can use Lemma [3] to expand this: 

l lo det(S x + + Ag T (S x + ZwY^g) 

2 ° g det(S w )(l + Ag^S iy 1 g) 

Thus minimizing over g reduces to minimizing 



□ 



(36) 



J(g) = log 



l + Ag T (S x + £ r ^-< 



l + Ag^g 



sec: 



Let g = (\l 1 — gf,fh, §m) t , where h G M. AI 1 is a unit vector. Taking the gradient of J(g) we 

A 



VJ(g) 



l + Ag r (S x + S w )- 1 g 



r (s Jf + s ff )- 1 ( 



l + Ag^g 
We can write the function </(•) with respect to g 2 M : 

\ \ \ ( g M .. i v-M-Vi n 2 \ fe^ , 



log- 



Let 



O'l 



M-l 

£ 



/l 2 



+ 0~ m 



m=l m 
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Taking a derivative with respect to g 2 M : 



8 A ( —3 «i 

- j {9m) — 



A (i-« 2 ) 



l + A(f| + (l-^)a 2 ) 



l + Aai + A&(jr^-ai) 
Af^-a 2 ' 



1 + Aa 2 + (_|_ _ a2 ) 



The derivative is positive if 



?m+o- 2 m 



l + Aa 1 + A^( f ^--a 1 ) 



> 



(*—) 



1 + Aa 2 + Ag 2 M - a 2 ) ' 



or 



ai ) (1 + Aai) > ( -5- - a 2 ) (1 + Aa 2 ) 



Note that this condition is independent of of gM, so for any h, the optimal value of gu = 1 or 
gu = 0. In the first case, the jammer's optimal strategy is to choose g = ejw- In the second case, 
(yj 1 — gjrfh, gu) yields a higher rate than (h, 0) so repeating the argument on h shows that g = e m 
for some m 6 [M — 1]. Therefore the optimal jammer strategy is to pick g equal to an elementary 
vector. 

If g = e m is optimal for the jammer, then for any i ^ m, 

det(S x + T, W + Ae m e^) det(S x +% + Ae^ef ) 
det(£ w + Ae m e^) det(T, w + Ae»ef ) 

or 

(r m + a 2 m + A)(r, + a, 2 ) ^ (r, + a 2 + A)(T m + a 2 .) 



< 



K 2 „ + A)a 2 ^ {a 2 + K)a 2 m 

Some algebra reveals that 

(r m + a 2 m + A)(ri + a 2 ) (a 2 + A)<4 

< (r< + a 2 + A)(T m + <4)(<4 + A)a 2 , 
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from which it follows that 



(T m + a 2 m + A)(r>f<4 + TialA + a 2 a 2 m A + afa 2 J 

< (Ti + a 2 + A)(T m o 2 a 2 m + T m a 2 A + a 2 o 2 m A + of <r. 



and finally that 

^i^mO'm + Tjcr^ + Tjcr^A < rjF m cr 2 + r m of + r m cr 2 A. 
So for a given S x , the optimal jamming direction is e m if 



r m + a 2 m + A Ti + a 2 + A 
for all % 7^ m. 

Now consider L(Sx,g) in (|36|) and suppose Sx is arbitrary. Let S x = diag({(Sx)mm}) be 
diagonal matrix containing the diagonal of Sx- Then by Hadamard's inequality, for any m, 

rrv° o ^ rrv o ^ 1 , det(£^ + £ w + Ae m e£) 
L(S x ,e m ) - L(Sx,e m ) = - log ■ 



2 * det(Sx + Z w + Ae m e^; 
> 0. 



Let m° = argmin ?n L(E X , e m ). Then 

minL(Sx,g) < L(Sx,e TO o) 
g 

< L(S x ,e m o) 

= minL(S x ,g), 
g 

so the transmitter can always increase the rate by choosing Sx to be diagonal. □ 

Proof of CorollarylM Let Ti > T2 > • • • > Tm be waterfilling power allocation, and let 7 = a 2 + T\ 
be the "water level." For this allocation, it is clear that 

w r,/. ? 



Ti + + A ^ + 0/ + A 

for all i 7^ 1, so by Theorem [71 the worst-case direction g for the jammer is g = ei and (135[) is 
achievable. If A < cr 2 — cr 2 then this is the optimal strategy for the full-rank jammer so (|35p is the 
capacity. If A > cr 2 — a\ then comparing this to -R w mi we see that R > -R w mi- □ 

4.4 The (2, 2, 1) MIMO AVC 

For a given diagonal covariance matrix, Theorem [7J shows that the jammer's optimal strategy is 
always to jam one of the subchannels. The set of covariance matrices for which the optimal jamming 
direction is e m is given by (|34p . Unfortunately, maximizing the rate subject to the conditions in 
(|34p does not lead to a clean solution. In the (2, 2, 1) MIMO AVC we can carry out the calculation 
explicitly. 
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Theorem 8. Let (3 be the value ofV± for which the terms in the maximization (|34p are equal: 



p/o-l (r - a)lo\ 



/3 + af + A r-/3 + a 2 2 + A' 



let 



1= l -{T-{al + K-al)), 



and let 




r — a 



) 



Then for the (2, 2, 1) MIMO AVC, 

• if o\ + A < o~2 then R{^i) is the capacity, 

• if a\ + A > a\, Y > a\ + A — o"| and 7 > /3 i/ien -^(7) is i/ie capacity, 

• i/ erf + A > and r < a\ + A — cr^ or 7 < /? i/ien -R(/3) is achievable. 

Proof. Note that the optimal jamming strategy is g = ei for T\ > f3 and g = e2 for Ti < (3. 
Suppose first that o~\ + A < a\. In this case, the waterfilling power allocation for noise spectrum 
(a\ + AjUg) is such that T\ > (3, so the capacity is given by ([3T]) . coinciding with Theorem [6l 

Now suppose that erf + A > a\. If T > + A — a\, then we consider two sub-cases. If the 
waterfilling power allocation (7, F — 7) for noise spectrum (erf + A, <r|) satisfies 7 > /3, then that 
power allocation is optimal and the rate is given by R(j)- However, if 7 < f3, then setting T\ = ft 
is optimal. For T\ < f3 the optimal jammer strategy is e2, but the achievable rate is monotonically 
increasing in T\. For T\ > (3 the optimal jamming strategy is &2 and the rate is monotonically 
decreasing in T\. Hence T\ = f3 \s optimal. □ 

From this result we can see the difficulty in (j32l) . Suppose r = 6, a\ = 3, erf = 1, and 
A = 4. If g = (0, then the waterfilling solution for a channel with Gaussian noise of covariance 
T,\y + Agg T = diag(3, 5) is to choose Sx = diag(4, 2). However, for this choice of Sx the optimal 
g = (1, 0) T . What this shows is that the max and min in f)32[) cannot be reversed. In two cases the 
waterfilling allocation is optimal, but in the other cases the transmitter chooses its power allocation 
and rate such that the jammer can jam either subchannel. The transmitter is forced to choose a 
power allocation different from Theorem [6] even when the jammer has a rank constraint. 

Figure [8] shows -R w fm and the rate given by Theorem [8] as a function of the interference power A. 
The curves are equal until the point where A = a\ — a\. For such values, the jammer can realize the 
waterfilling strategy. However, for larger values of A, the rank constraint on the jammer prohibits 
waterfilling across multiple channels. Under a diagonal input covariance the optimal strategy is to 
allocate all of A to a single channel. For large interference powers, the rank constraint allows the 
transmitter and receiver to communicate at rates strictly higher than R w fni- 

We can also examine the asymptotic behavior of the capacity as A — > 00. The optimal jamming 
strategy is still to jam the less noisy channel, so the noise-plus- interference spectrum becomes more 
and more unbalanced. Clearly the subchannel with noise af + A contributes no rate to the capacity 
in the limit. However, any power in the first subchannel will still contribute. As A — > 00 the 
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Rate vs. A for = 3, = 1 , r = 4 




1 2 3 4 5 6 7 8 

A (interference power) 



Figure 8: An example of the achievable rates versus the interference power A for o~\ = 1, cr| = 3, 

r = 4. 

limiting behavior is given by the threshold condition in (|34p . The optimal power allocation is given 
by 

£i = r_2 

Corollary 3. For the (2,2,1) MIMO AVC with a\ < a\, the following rate is achievable in the 
limit as A — >• oo: 

fl=ilog(l+ J 2 ) . 
2 V ^1+^2/ 

Note that this rate is less than i? u b = \ log ^1 + -J^J in (f33|) . We conjecture that this loss with 
respect to the optimal strategy knowing the jammer's strategy is inherent due to the adversarial 
nature of the channel. 

A different regime is when T and A go to oo while keeping the ratio p = T/A fixed. 

Corollary 4. For the (2, 2, 1) MIMO AVC with a\ < a\, ifT, A -> oo with fixed p = T/A then the 
achievable rate scales according to 

i?( P ,r) = o(iogr) + iiog(i + £). 

The results here are a first step towards understanding the effect of rank-limited uncertainty 
in interference for MIMO systems. There are a number of interesting open questions for future 
work. Firstly, finding the optimal rate in Theorem [7] requires optimizing over different sets of 
power allocations corresponding to different optimal jamming strategies. We conjecture that the 
optimal rate always corresponds to the jammer setting g = ei, where a\ is the smallest noise 
variance. Secondly, showing that this rate is indeed the capacity of the MIMO AVC requires 
different techniques than the full rank case, where the jammer and transmitter strategies formed 
a saddle point. Finally, extending these results to more general numbers of transmit, receive, 
and jamming antennas would be very interesting and may shed some light into other problems in 
rank-limited optimization. 
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5 Conclusion 



In this paper we investigated some variations on the basic Gaussian AVC model to illustrate different 
aspects of coding for worst-case interference. One way of interpreting these results is as intermediate 
stages between worst-case and average-case analysis. The worst-case interference in the GAVC can 
depend on the codebook of the transmitted message. However, with additional resources, this worst- 
case behavior can be relaxed to attain rates closer to the average-case behavior. We demonstrated 
that a very small amount of common randomness is sufficient to achieve the randomized coding 
capacity of the GAVC, that a known interference signal can help mask the codeword and allow 
reliable communication with moderate interference, and that extra degrees of freedom can overcome 
even worst-case interference. 

There are still several open questions that remain. Is the O(logn) bits of common randomness 
necessary to achieve the randomized coding capacity? Finding lower bounds on the amount of 
randomness can quantify how close the worst case is to the average case. Is the dirty-paper coding 
scheme optimal? We conjecture that it is, in the sense that the encoder cannot exploit the known 
interference beyond the geometric approach that we describe. For the general MIMO AVC with 
rank-limited jammer is there a simple characterization of the optimal transmitter and jammer 
power allocations? We conjecture that the worst-case interference jams the strongest sub-channel. 

Our results show that the analysis of coding schemes robust to unknown interference is highly 
dependent on the resources available to the encoder and decoder. The arguments here are geometric 
in nature, and it may be interesting to pursue the relationship between worst-case and average-case 
structures for other high-dimensional problems. 
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